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Abstract 

Proposed extensions of the ML type system to incorporate global 
overloading include the systems of [Kae88, CD091, Smi91, Kae92, 
Jon92] and those related to the design of the functional programming 
language Haskell [WaB89, CH092, NiP93]. These systems have in 
common the notion of a constrained type scheme which in some is 
realized by type kinds and in others as explicit predicates. An analysis 
of these type systems reveals that some are unsound with regard to 
a suitable criterion for typability and some adopt a notion of type 
generality that is inconsistent with that of system ML [DaM82]. 



1 Introduction 

Much of the research in type theory is devoted to finding new type systems 
that offer programmers more flexibility without compromising the advantages 
of a strongly-typed language. One of the simplest type systems is Ao, the 
implicit, finitely- typed lambda calculus [Tiu90, BaH90]. It assigns finite 
types to type-free A terms. For example, in Ao one cam derive the typing 
h Az. x : a —* a. In fact, one can derive h Az. z : r — ► r for any finite type 
r. Consequently, Az. z can be assigned multiple types within a single term 
as is necessary in deriving a type for (Az.z) Az.z in Aq. 
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Although A 0 allows a term denoting a polymorphic value to be assigned 
multiple types, it does not allow a free identifier that denotes such a term 
to be given multiple types within a single A term. For example, the term 
(A y. yy) Ax. x is untypable in Ao even though y , which is free in yy, denotes 
Ax.x. So although Ao has desirable properties like a decidable typability 
problem and principal types for all typable terms, it is for the most part 
unsuitable for direct use in a programming language. 

This limitation was overcome in the ML type system [Mil78, DaM82] 
which is an extension of Ao with type schemes. A type scheme represents a 
very simple kind of polymorphism called parametric polymorphism. A free 
identifier may be asserted to have infinitely many types via a type scheme. 
However, in the interest of retaining principal types, lambda abstraction 
in system ML is monomorphic as in Ao- Thus (Ay. y y) Ax. x is untypable in 
system ML as well. How then does system ML allow free identifiers denoting 
polymorphic values to be assigned multiple types? Only through its let 
construct. So for example, (Ay. yy) Ax.x is written let y = Ax. x in yy. 

Like Ao, system ML has a decidable typability problem and typable terms 
have principal types. Further, it seems to offer an acceptable level of flexibil- 
ity as evidenced by its incorporation in languages like Standard ML [HMM86] 
and Miranda [Tur86]. Yet it remains limited in some ways that appear to be 
solely of theoretical interest, and in others that are very obvious in practice. 

One practical limitation is that it prohibits overloading by restricting to 
at most one the number of assumptions per identifier in a type assumption 
set, a limitation noted by Milner himself in [Mil78]. Suppose we wish to 
assert that a free identifier, say +, has precisely finite types int — ► int — ► int 
and real — ► real —* real. Any assumption set in which + has one of the two 
desired finite types precludes a derivation that it has the other. On the other 
hand, any assumption set that assigns type scheme Va.a — ► a — ► a to + 
is one from which too many types can be derived for +. Thus there is no 
assumption set in system ML from which we can derive precisely the desired 
finite types for +. Even system ML with subtypes is inadequate. From 

A = {int C real , -f : real — + real — + real} 

one could derive A F + : int — *• int — ► real but not A l - + : int — ► int — ► int . 

Many extensions of system ML have been proposed to incorporate over- 
loading. There are the systems of [Kae88, CD091, Smi91, Kae92, Jon92] and 
those related to the design of the functioned programming language Haskell 
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[WaB89, CH092, NiP93j. This paper reviews these systems. As we shall 
see, some axe unsound under a proper notion of typability and some even 
adopt a notion of type generality that is inconsistent with the original notion 
proposed in [DaM82]. 

2 Constrained Type Schemes 

The terms of the type systems we consider are typical. Every variable x 
is a term, and if M and N are terms then so are A x.M, (M N), and 
let x — M in N. There is no expression that overloads identifiers locally 
like the over expression proposed in [WaB89]. The only overloading the sys- 
tems attempt to address is that which comes from an initial basis or sequence 
of top-level definitions, which we call global overloading. 

Each system has the same set of finite types defined in the usual way. 
Every type variable a is a finite type, and if r x , . . . , r n are finite types then 
so axe T\ — ► t 2 and x( T i > • • • > r n) where x is a type constructor of arity n. 

The systems have in common the notion of a constrained type scheme 
which in some is realized by a type of types called a kind and in others as an 
explicit predicate. A kind is a universe of types over which a type variable 
may be quantified. Kinds appear in [Kae88, CD091, CH092, NiP93] where 
the general form of a constrained type scheme is Va : p. a for some kind p. 
Explicit predicates appear in the systems of [WaB89, Smi91, Jon92, Kae92] 
where the general form of a constrained type scheme is 

Votj, . . . , OC n with Xj . Tj, . . . , X m . T m . T 

for predicates x x : r x ,...,x m : r m . Predicates also appeared earlier in the 
work of Holmstrom [Hol83]. 

Although syntactically different, constrained type schemes of all these 
systems may be interpreted a 5 Horn clauses. For example, suppose multipli- 
cation (*) and the multiplicative identity (1) are overloaded in a type basis. 
Then if both were free in a definition of expon that raises a value to some 
integer power then expon should have as types r — ► int — ► r for any finite 
type r for which predicates 1 : r and * : r — ► r — ► r are true. This may be 
expressed by a constrained type scheme using predicates as 

expon : Va with 1 : a, * : a —*• a — ► a . a — ► int — ► a . 
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If T x (t) means x : r then this is interpreted as the Horn clause 
T eX p 0n {a — ► int — ♦ q) «— T,(a) A jT.(a — ► a — *• a) . 

It is natural to consider Horn clauses with other kinds of predicates as 
well expressing, for example, subtype inclusions [Smi91, Jon92, Kae92] and 
even unification among finite types [Kae92]. The latter kind of predicate 
shows that typability in system ML is actually an instance of typability in 
a predicate-based system. Of course in system ML, the absence of a unifier 
implies the given term is untypable and if unification succeeds, then a uni- 
fying substitution can be reflected in an unconstrained type scheme, so no 
explicit predicates or kinds are needed. 

3 A Criterion for Typability 

When is a term typable with respect to a set of type assumptions? One cri- 
terion for typability often used for system ML is called strong type inference 
[Tiu90]. Under it, a term M is typable with respect to an assumption set B 
if there is an assumption set A and type a such that B C A and A h M : a 
is derivable. Thus in a formulation of the typability problem, M and B are 
considered input. Yet the problem only requires finding an A that contains 
B so there is considerable latitude in the choice of A. In fact, within the 
context of overloading, there is too much. Consider the term true + true, 
where true denotes a truth value and + integer addition as conveyed by the 
assumption set 

B = {true : bool, + : int —* int — ► int} . 

Then the term could be typable under B in a sound type system since with 
A = B U { + : bool — ► bool — ♦ bool } 

we can derive A h true + true : bool. But B h true + true : bool is 
not derivable, since + does not have an instance for type bool within B. 
Therefore the term should not be typable under B. 

Thus a suitable criterion for typability is strong type inference with the 
requirement that B \~ M : cr be derivable. However <x is constrained so unless 
a type system prevents quantification over an empty type universe, it may 
be unsound in that terms with domain incompatibilities are typable. 
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4 Soundness of the Proposed Systems 



The early work of Kaes [Kae88] led to an extension of system ML based 
on type kinds. It permits a restricted form of overloading called parametric 
overloading which, when extended to allow overloading of constants, corre- 
sponds to the kind allowed in Haskell. Kaes gives a type inference algorithm 
where unification is done with variables having kinds. This was later discov- 
ered to be an application of order-sorted unification in [NiS91] where type 
inference in Haskell, which permits overloading via classes, is implemented 
in the context of order-sorted algebras. 

Kaes’s extension has two kinds of assumption sets, one for ordinary type 
assumptions and the other for overloading assumptions. For example, 



I f : Va {/} •<*{/} “*■ a {/}> \ 

{ g : Va {5} . a {j} -* a {ff} / 



is a type assumption set and 



0 = 



f 



f : (u —* u>, {int, real}), 
g : (ui —+ u>, {bool, char}) 



) 



is an overloading assumption set. Set 0 specifies the different finite types for 
the overloaded identifiers / and g. The types of /, for example, are formed 
by replacing uj in u — ► u> by int and real. 

A type variable may be annotated with a set of identifiers that effectively 
specifies its kind. The variable ranges only over those finite types for which 
all functions in the set are simultaneously defined. So a/, for example, ranges 
over types int and real , but not bool , with respect to 0 . 

Under the strong type inference criterion, a term M would be typable 
with respect to A and 0 in this system if 



A, 0 h M : cr for some type a. 



But the system is unsound under this criterion, for even though 0 specifies 
that / and g have no finite types in common, we can derive 

A,0\- \x. g(f x ) : Vor^}. -4 ay ig y 
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in the system: 



a {Ln} Q U.g } <« Vq {/}- q {/} — Q U) (<o) 

A U {x : a {/>5} }, Oh f: a {fig} -* a u>g} (var) 

<*{/.<?} -* a if.9} <0 Va {ah Q {9) a { 5 } (<o) 

A U {x : a {/>s} }, Oh g : a {]<g) — a {/>j} (var) 
^U{x: ct{j , g} } , 0 h g(f x) : q {/ , s} (app) 2 

A, Oh Ax. ^(/ x) : a {J<g] a {/ij} (abs) 

A, 0 h Ax. ^(/ x) : a{y, a } — ► (closure) 



In essence we can hypothesize overlap in the types of / and g through type 
ct{j, g } for the sake of showing A x.g(f x) is typable, yet we are not required 
to show that any overlap exists. Thus a type can be derived for the term 
under A and 0 even though the term has a type incompatibility. A sound 
variant of this system is given in [Rou90, Kae92]. 

Parametric overloading prevents certain kinds of overloadings like 

{ + : int —* real — ► real, 1 

+ : real — + complex —* complex j 

which is useful for describing mixed-mode arithmetic. In an attempt to 
permit more general kinds of overloadings, Wadler and Blott give a predicate- 
based extension of system ML [WaB89]. But like [Kae88], it too is unsound 
under strong type inference. Further, unlike Kaes’s system which restricts 
overloading, typability in their system under a revised criterion is undecidable 
[VoS91]. A restriction that allows overloadings for mixed-mode arithmetic 
and for which typability is decidable is given in [VoS91]. 

The type system ML, described in [Smi91] is sound. It replaces the type 
generalization and instantiation rules of ML by the following rules. 



(V-intro) 



AUChM-.T 1 , dhC[a :=f] 
Ah M : Va with C . r' 



(a not free in A) 



(V-elim) A h M : Vq with C . r', Ah C{a := f ] 
Ah M : r'[a := f] 
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To illustrate these rules, consider a term M = Ax. (x + x), with free identifier 
+, and a type assumption set B defined as follows. 



+ : int — ► int — ► int, 

B — { + : rea ^ rea ^ rea ^ 

< : char — ► char — ► bool, 

, < : Va with < : a — *• a — » 600/ . seg(a) — ► se^(or) — bool , 

We show a derivation of B h Af : Va with + :a-MJ->a.a^ain ML 0 . 



B U {+ : a — * a — ► a} U {x : a} h x : a 
B U {+ : a — * a — *-o;}U{x:a}l- + :a— * a — * a 
BU{+:a-*a-*a}U{x:Q}l"(x|x):a 
Bll{+:a - * a — »or}H Ax. (x + x) : a — * a 
B \~ {+ : a — * a — ♦ a} [a := real ] 

B 1- Ax. (x + x) : Vo; with + :a — * a — * a . a — * a. 



(hypoth) 
(hypoth) 
(— ^-elim) 2 
(— »-intro) 
(hypoth) 
(V-intro) 



The last step of the derivation discharges assumption + : a — > a — * a which 
was used to derive a type for M, making a type derivation possible from the 
initial set B. Before this could be done however, we needed to establish that 
the assumption is derivable with a instantiated to some finite type ( real was 
chosen in our example although int would have sufficed). For some terms 
and type assumptions, this is impossible. The term M = Ax. A y.y < (x + x) 
together with B is an example. We can hypothesize a type assumption set 
C in which + and < overlap on some finite type, say a, by defining C as 



C = {+ : a — ► or — *• a, <: a —* a — ► bool } 



and then derive bool. But the derivation must end 

here because there is no finite type for a that allows us to derive C from B , 
so M is untypable. 

The work of [CD091] is an effort to relax the restrictions of a parametric 
overloading. They give a kind-based extension of system ML that restricts re- 
cursive overloadings slightly less than parametricity and for which typability 
is decidable. Yet assumptions for mixed-mode arithmetic are still prohibited. 

The system however is sound under strong type inference. It is interesting 
to see how the system prevents a type from being derived for term A x.g(f x) 
when free identifiers / and g have no finite types in common. 
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Derivations are made with respect to a traditional type environment A 
and a kind environment T that maps type variables to kinds. There are two 
operators on kinds, fix and a union operator U. Kinds are partially ordered 
by a containment relation < such that JL < p for any kind p. 

The type instantiation and generalization rules of system ML are modified 
to accomodate constrained type schemes with kinds: 



(inst) 


T; Ah M : Vq : p.a 


(r(r)<p) 




T; A h M : cr[a := r] 


(gen) 


T, q : p; A h M : a 


(a not free in A A p ^ _L) 




T; A h M : Vq : p. a 



Here r(r) is finite type r with all free type variables replaced by the kinds 
prescribed for them by T. 

Suppose 

^ _ f / : Vo : int U real, a — ► a, 1 
[ g : V a : bool U char, a —* a J 

The only way we can hypothesize any overlap between / and g is through 
use of kind X since X < p for any kind p. Proceeding in this way yields 





X}; A U 


{x : 3} h x 


:/? 




(var) 


m = -l < 


int U real 






(<) 


{(3: 


_L}; AU 


{x: (3}Y- f 


: Vq: 


: int U real, a — ♦ q 


(var) 


{/?: 


X}; AU 


{x: (3}h f 


■(3^(3 


(inst) 


m = -L < 


bool U char 






(<) 


{/?: 


X); AU 


{*:/*}! ~ 9 


: Vq: 


bool U char, a — ► q 


(var) 


{?■ 


X}; AU 


{x : P) h g : 


= 0- 


■13 


(inst) 


{/?: 


X}; AU 


{x : /?} h g(f x) 


:/3 


( a PP)' 




X}; A h 


\x.g(f x ) : 


0^ 


13 


(abs) 



The generalization rule cannot be applied at this point to discharge the re- 
maining kind assumption {/? : _L} so A x.g(f x ) is untypable under A. 

The type system of [CH092] is another kind-based extension of system 
ML, that unlike [CD091], is unsound under strong type inference. This work 
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is not an attempt to relax the restrictions of parametric overloading but 
rather to formalize a type system for Haskell with parametric type classes. 

Derivations in [CH092] are also made with respect to a traditional type 
environment A and a kind environment, which they call a context G, mapping 
type variables to kinds. Kinds here are taken to be sets of class names T with 
each class having an optional finite type as parameter. So for example, if F 
and G are class names, then {a:: {F, G}} is a kind environment that asserts 
a is a class instance of F and G, meaning that a is a type over which the 
operations of both classes F and G are defined. 

The type instantiation and generalization rules of system ML are replaced 
by rules (V-elim) and (V-intro): 



(V-elim) 



(V-intro) 



A, C I- M : VanT. <r 
A, C b M : a[a := r] 

A, C,a::T\~ M : a 
A, Ch M : Va::I\<r 



(CH-r :: T) 

(a not free in A or C ) 



where C H- r::T is derived using a set of entailment inference rules with 
class instance declarations D as axioms. For example, if D = {int :: F} then 
0 H- int :: F. 

Suppose that 

D = {int :: F, real :: F, bool :: G , char :: G} 



which asserts that class F has as instances finite types int and real, and class 
G, types bool and char. So with respect to D, there is no finite type over 
which the operations of F and G are defined. But with 



A = 



/ : Va::{F}. a — ► a, ) 
g : Vq::{G}. a — ► a J 



we can derive 



Ab\x.g(fx):V0::{F,G}.P-+0 

in the system, thus establishing that A x.g(f x ) is typable with respect to A 
under the strong type inference criterion even though the derived type scheme 
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has no finite types as instances with respect to D. Using kind {F,G}, we 
hypothesize overlap between / and g yielding the derivation 



AU{x:/3}, {/3::{F,G}}hx:f3 


(var) 


{/?::{F,G}}4-/?::F 


Fe{F,G} 


{/3::{F,G}}H-/3::{F} 


(H-) 


A U {x : 13}, {/3 :: {F, G}} h f : Va::{F}. a - a 


(var) 


A U{x:(3}, {0::{F,G}}\-f:/3->/3 


(V-elim) 


{/3::{F,G}}\b(3::G 


Ge{F,G} 


{p::{F,G}}Yr(3::{G} 


(H-) 


A U {x : /?}, {/3 :: {F, G}} H g : Va::{G}. a - a 


(var) 


A\j{x:/3}, {(3::{F,G}}hg:/3^3 


(V-elim) 


A\J{x:/ 3}, {(3::{F,G}}hg(fx):i3 


(A-elim) 2 


A, {(3::{F,G}}\-Xx.g(fx):j3-I3 


(A-intro) 


A\-Xx.g(f x):V/3::{F,G}.!3->/3 


(V- intro) 



In the final step, the entire context is discharged even though there is no finite 
type that satisfies it with respect to D. An almost identical derivation can 
be constructed in the system of [NiP93] and in the predicate-based extension 
of system ML given in [Jon92]. With a minor reformulation of A and D 
requiring kinds to be interpreted as predicate sets so that for example D is 

{F int, F real, G bool, G char} 

viewing F and G as predicates, one can derive in the system of [Jon92] 

D\A h A x.g(f x) : Va.{F a, G a} => a — *• a. 

Although D specifies no overlap for / and g, we can prove the judgement. 



5 Type Generality 

In order to define in the systems considered here a notion of principal type 
among derivable types, a generic instance relation is needed that tells us 
whether one type is more general than another [DaM82]. In the context of 
constrained type schemes, this relation becomes a preorder rather than a 
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partial order since two distinct types can each be a generic instance of the 
other. 

Some of the proposed type systems require that in order for a constrained 
type scheme o' to be a generic instance of smother such scheme o, it must be 
possible to replace the bound variables of o by finite types so as to make the 
bodies of o and o' syntactically equal [WaB89, CH092, Kae92, Jon92]. This 
is an axtifact of the original relation of [DaM82], which for system ML, gives 
a way to check that o has as instances all finite types that are instances of 
o', or in other words, that o is more general than o' . However, the relation 
is also typically defined relative to some form of assumptions from which an 
attempt can be made to show that a finite type is an element of a kind or 
that a predicate is true. Now it is no longer appropriate to demand syntactic 
equality of type bodies in order to establish that one constrained type scheme 
is more general than another. 

For example, in the system of [CH092], constrained type scheme 
\/cr.:{F}.V/3::{F}. a - fi 
is not a generic instance of 
V7::{F’}. 7-^7 

with respect to any set of class instance declarations since there is no sub- 
stitution on 7 that when applied to 7 — ► 7 gives a — ► j3. Yet in the context 
of a single declaration, say {int ::F}, the latter type does indeed have as 
instances all finite types that axe instances of the former (only int — ► int ) 
and is therefore more general in this context. Similar examples can be given 
in the systems of [WaB89, Kae92, Jon92]. 
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